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ABSTRACT. In this paper we explore various interconnections between rich words, Sturmian words, 
and trapezoidal words. Rich words, first introduced in lfT31 by second and third authors together with 
J. Justin and S. Widmer, constitute a new class of finite and infinite words characterized by having 
the maximal number of palindromic factors. Every finite Sturmian word is rich, but not conversely 
IfTTII . Trapezoidal words were first introduced by the first author in studying the behavior of the 
subword complexity of finite Sturmian words. Unfortunately this property does not characterize 
finite Sturmian words. In this note we show that the only trapezoidal palindromes are Sturmian. 
More generally we show that Sturmian palindromes can be characterized either in terms of their 
subword complexity (the trapezoidal property) or in terms of their palindromic complexity. We also 
obtain a similar characterization of rich palindromes in terms of a relation between palindromic 
complexity and subword complexity. 



1. Introduction 

In [fTTTl . X. Droubay, J. Justin, and G. Pirillo showed that a finite word W of length | W\ has at 
most \W\ + 1 many distinct palindromic factors, including the empty word. In lfl31 . the second 
and third authors together with J. Justin and S. Widmer initiated a unified study of both finite and 
infinite words characterized by this palindromic richness property. Accordingly we say that a finite 
word W is rich if and only if it has | W\ + 1 distinct palindromic factors, and an infinite word is rich 
if all of its factors are rich. Droubay, Justin and Pirillo showed that all Episturmian words (in par- 
ticular all Sturmian words) are rich. Other examples of rich words are complementation symmetric 
sequences |fT5l , symbolic codings of trajectories of symmetric interval exchange transformations 
Ifl3l[l4ll . and certain /3-expansions where (3 is a simple Parry number JT). 

Let u be a non-empty factor of a finite or infinite word W. A factor of W having exactly two 
occurrences of u, one as a prefix, and one as a suffix, is called a complete return to u in W. In [[131 . 
the following fact is established: 

Proposition 1. A finite or infinite word W is rich if and only if for each non-empty palindromic 
factor u of W, every complete return to u in W is a palindrome. 

In short, W is rich if all complete returns to palindromes are palindromes. Given a finite or 
infinite word W, let Cwip) (respectively Pw(n)) denote the subword complexity function (respec- 
tively the palindromic complexity function) which associates to each number n > the number of 
distinct factors (respectively palindromic factors) of W of length n. Infinite Sturmian words are 
characterized by both their subword complexity and palindromic complexity. An infinite word W 
is Sturmian if and only if Cw(n) = n+ 1 for each n > 0. In [f!2l . X. Droubay and G. Pirillo showed 
that W is Sturmian if and only if Pwip) = 1 whenever n is even, and P\y( n ) = 2 whenever n is 
odd. In [7], the first author studied the complexity function of finite words W. He showed that if 



Date: February 2, 2008. 

1991 Mathematics Subject Classification. 68R15. 

Key words and phrases. Sturmian words, palindromes, palindromic complexity. 

1 



2 



ALDO DE LUCA, AMY GLEN, AND LUCA Q. ZAMBONI 



W is a finite Sturmian word (meaning a factor of a Sturmian word), then the graph of Cjy(n) as a 
function of n (for < n < \W\) is that of a regular trapezoid: that is CV(n) increases by 1 with 
each n on some interval of length r, then CV(n) is constant on some interval of length s, and fi- 
nally Cw(n) decreases by 1 with each n on an interval of the same size r. Such a word is said to be 
trapezoidal. More precisely, for any word W let us denote by R w the smallest integer p such that 
W has no right special factor of length p, and by K w the length of the shortest unrepeated suffix 
of W . Then we say that W is a trapezoidal word if and only if | W\ = Rw + K w . However, in Q 
the first author shows that the property of being trapezoidal does not characterize finite Sturmian 
words. For instance, the word aaabab is not Sturmian although it is trapezoidalQ 

The main results of this note are to give characterizations of both rich palindromes and Sturmian 
palindromes in terms of the palindromic complexity functions. We also show that every trapezoidal 
word is rich, but not conversely. In the case of rich palindromes we proved 

Theorem 1. Let W be a finite word. Then the following two conditions are equivalent: 

(A) W is a rich palindrome. 

(B) P w {n) + P w {n + l) = C w {n+l)-C w {n) + 2foreachO<n< \W\. 
While for Sturmian palindromes we proved 

Theorem 2. Let W be a word of length N. Then the following three conditions are equivalent: 

(A' ) W is a Sturmian palindrome. 

(B') P w (n) + P W (N -n) = 2 for each < n < N. 

(C) W is a trapezoidal palindrome. 



2. Rich vs Trapezoidal Words 
In this section we show that all trapezoidal words are rich: 
Proposition 2. Let W be a trapezoidal word. Then W is rich. 

Proof. We proceed by induction on \W\. The result is clearly true if \W\ < 2. Suppose every 
trapezoidal word of length less than TV is rich, and suppose that W is trapezoidal (say on the letters 
{a, b}) of length N. Let us suppose to the contrary that W is not rich. Then, by Proposition \T\ 
in W there exists a complete return to some palindrome P which is not a palindrome. Since, 
on a binary alphabet, a complete return to a letter is always a palindrome, we can write (without 
loss of generality) that P = all a with U possibly empty. Since the prefix and suffix of W of 
length iV — 1 are both rich (by the induction hypothesis), it follows that all a is both a prefix and a 
suffix of XV, and that these are the only two occurrences of all a in W. So W itself is the complete 
return to all a which is not a palindrome. In particular W is not a palindrome, which implies that 
\W\ > 2\aUa\ +2. 

It follows that K w = \aUa\ + 1 since alia occurs twice in W and if some longer suffix of 
W occurred more than once in W, then alia would occur at least three times in W. Since W 
is trapezoidal, we have R w + K w = \W\. Now the word W has a period q = \W\ — \aUa\ = 

l ln 0, F. D' Alessandro classified all non-Sturmian trapezoidal words. 

2 An infinite version of Theorem[T]was obtained by the second and third authors together with M. Bucci and A. De 
Luca in [4| using completely different methods. 

A different characterization of Sturmian palindromes was obtained by A. de Luca and A. De Luca in [8 1. See also 

m. 
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R w + K w — (K w — 1) = R w + 1. Let 7i w denote the minimal period of W. Then % w < R w + 1. 
Since for any word W, ir w > R w + 1, it follows that tv w = R w + I. From Proposition 28 of (SI 
we deduce that W is a Sturmian, and hence rich, a contradiction. □ 

Remark 1. We note that the converse is false; in fact aabbaa is rich but not trapezoidal. 

3. Proof of Theorem CD 

We first show that (B) implies (A). We assume W satisfies (B). Taking n = \W\ and using 
P W {\W\ + 1) = C W {\W\ + 1) = and CW(|W|) = 1, we deduce that P W (\W\) = 1, and hence 
W is a palindrome. It remains to show that W is rich. Let S denote the total number of distinct 
palindromic factors of W. We will show that S = \ W\ + 1. Since W itself is a palindrome we have 

\w\-i 

s-i= 

n=0 

Similarly since the empty word is a palindrome we have 

\w\ 

S-l = J2 P w(n) 

n=l 

Thus 

\W\~1 \w\ 
2S-2 = Pw(n) + J2 p w(n) 

n=0 n=l 
\W\-1 

= j2(P w (n)+P w (n + l)) 

n=0 
\W\-1 

= Yl (Cw(n + l)-C w (n) + 2) 

n=0 

= C W (\W\) -C w (0) + 2\W\ 
= 1-1 + 2|W| = 2|W|. 

Hence S — \W\ + 1 as required. 

Next we show that (A) implies (B). We proceed by induction on the length of W. The result 
is easily verified in the case \W\ < 2. Now suppose the result is true for all rich palindromes of 
length less than iV and suppose W is a palindrome of length N. Let V denote the palindrome of 
length N — 2 obtained by removing the first and last letter of W. Since V is also rich (see |fT5l ), 
by the induction hypothesis we have Pv(n) + P v (n + 1) = Cy(n + 1) — Cy(n) + 2 for each 
< n < N -2. 

Let A^ denote the length of a shortest factor U of W which is not a factor of V. Then for 
< n < N - 1 we have P w {n) + P w (n + 1) = C w {n + 1) - C w {n) + 2. 

The word U is either a prefix or a suffix of W. We claim that it is in fact both a prefix and a suffix 
of W, in other words a palindrome. Suppose to the contrary that U is not a palindrome. Without 
loss of generality we may assume that U is a suffix of W. Let U' denote the longest palindromic 
suffix of U. Since \U'\ < N 0l we have U' is also a factor of V. Hence there exists a complete 
return Z of U' which is a proper suffix of W. Since W is rich, Z is a palindrome. Since we are 
assuming that U is not a palindrome and that U' is the longest palindromic suffix of U, it follows 
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that \Z\ > \U\. Since W is a palindrome, Z is also a prefix of W, and hence the proper suffix U of 
Z occurs in V, a contradiction. Thus U is a palindrome, and hence both a prefix and a suffix of W 7 ! 
Thus U is the only factor of W of length iVo which is not a factor of V. Thus we have 

P W (N ) = P V (N ) + 1 and C W (N ) = C V (N ) + 1. 

Since P V (N - 1) + Py(iVo) = C V (N ) - C V (N - 1) + 2, P V (N - 1) = P w (iV - 1), and 
CV(iV - 1) = C W (N - 1), we deduce that 

P W (N - 1) + (Pw(N ) - 1) = (CV(iVo) - 1) - CW(JV - 1) + 2 

and hence 

P w (iVo - 1) + P w (iV ) = C W (N ) - C W (N - 1) + 2 

in other words equality in (B) also holds for n = N — 1 . 

We now claim that the only palindromic suffix of W of length greater than N is W itself. In 
fact, if W admitted a proper palindromic suffix of length greater than N , then U would be a factor 
of V, a contradiction. Thus we have 

(3.1) P w (n) = P v (n) for all N < n < N. 

Also, for each N < n < N, let UX (respectively XU) denote the prefix (respectively suffix) 
of W of length n, where X denotes the reversal of X. Since UX is not a palindrome it follows that 
UX y£ XU. Thus 

(3.2) C w (n) = C v (n) + 2 for all N < n < N. 

We now verify (B) for n = N . Starting with P V (N ) + P V (N + 1) = C V (N + 1) -C V (N ) +2 
we obtain 

(P W {N ) - 1) + P W (N + 1) = (C W (N + 1) - 2) - (C W {N ) - 1) + 2 

and hence 

iV(iV ) + P^(iVo + 1) = C7 w (iV + 1) - C W (N ) + 2. 

We next verify (B) for N < n < N - 2. Starting with P v (n) + P v (n + 1) = C v (n + 1) - 
CV(n) + 2 and using (13.11) and (|3.2I) we obtain 

iV(ra) + + 1) = (CW(n + 1) - 2) - (C^H - 2) + 2 

and hence 

P w (n) + P w {n + 1) = C w {n + 1) - CW(n) + 2. 

It remains to verify (B) for n = N — 1 and n = N.lfW is the constant word, then Pw(N — 1) = 
l,P w (iV) = 1,P W (N + 1) =0,C W (N-1) = 1,C W (N) = l,andCW(iV + l) = 0. Otherwise, 
P W (N-1) = 0,P w (N) = 1,P W (N+1) = 0,C W (N-1) = 2,C W (N) = 1, and C W (N+1) = 0. 
In either case one readily verifies (B) for n = N — 1 and n = N. This completes the proof of 
Theorem [TJ 
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4. Proof of Theorem [2] 

We begin with the following lemma: 

Lemma 1. Let W be a word of length N satisfying either condition of Theorem^ Then W is a 
rich palindrome. Hence by Theorem\l\we have Pw( n ) + Pw( n + 1) — Cw( n + 1) — Cw(n) + 2 
for < n < N. 

Proof. Since any Sturmian word is trapezoidal, by Proposition [2] one has that if W satisfies ei- 
ther condition (A') or (C), then it is rich. Let us suppose that W satisfies condition (B'). Since 
P W (N) = P w (0) = 1, we have W is a palindrome. To see that W is rich, let S = P^(0) + 
iV(l) + -^V(2) + ••• + Pw(N) denote the number of distinct palindromic factors of W. Then 

2S = P w (0) + P w (N)+P w (l) + P w (N-l) + ... + P w (N) + P w (0) 
= 2(N + 1). 
Whence S = N + I = \W\ + I. 

□ 

We note that condition (B') is equivalent to saying that the word P W (0)P W (1)P W (2)...P W (N) 
is a # -palindrome on the alphabet {0,1,2} with respect to the involutory antimorphim 9 defined by 
9(0) = 2, 9(2) = and 9(1) = 1. 

Assume first that W is a Sturmian palindrome. For < n < N — 1, set D w (n) = Cw(n + 1) — 
C w (n). In Q, the first author showed that the word D w (0)D w (l)D w (2)....D w (N - 1) is of the 
form l r s (— l) r . In other words, that W is a trapezoidal word: CV(n) increases by 1 with each n 
on an interval of length r, then stabilizes, and eventually decreases by 1 with each n on an interval 
of the same size r. The trapezoidal property of W together with the preceding lemma imply that 
the word P\v(0)Pw(^)Pw(^)---Pw(N) begins with a block of the form 121212 . . . (corresponding 
to the interval of length r on which Cw(n + 1) — Cw(n) = 1), and terminates with a block of 
the form ... 010101 (corresponding to the interval on which CV(n + 1) — Cw(n) = —1), and 
moreover by the trapezoidal property, these two blocks are of the same length. Between these 
two blocks is either a block of the form 11 ... 11 or of the form 202 . . . 020 corresponding to the 
interval on which Cw(n + 1) — C\y(n) = 0. Hence W satisfies condition (B'). 

Next suppose W satisfies (B'). First observe that for each n we have Pw(n) E {0, 1, 2}, and 
Pw ( 1 ) 7^ . If Pw ( 1 ) = 1 , then W is equal to the constant word, and hence a Sturmian palindrome. 
Next suppose Pw(l) = 2. In this case W is a binary palindromic word, say on the alphabet {a, b}. 
To show that W is Sturmian, it suffices to show that W is balanced, i.e., given any two factors 
u and v of W of the same length, we have ||it| a — \v\ a \ < 1, where \u\ a denotes the number of 
occurrences of the letter a in u. Suppose to the contrary that W is not balanced. Then, it is well 
known (see for instance Proposition 2.1.3 in [3]) that there exists a palindrome U such that both 
alia and bUb are factors of W. Thus W contains two distinct palindromes of the same length, 
which implies that \U\ is odd. For otherwise, if \U\ were even, then taking k = 2~ 1 \U\ + 1, we 
have Pw(2k) = 2, and hence by (B') Pw(N — 2k) = 0, and hence P W (N) = 0, a contradiction. 
Since W is a palindrome and contains both all a and bUb, the palindrome U must have at least two 
complete returns in W, one beginning in Ua, which we denote by X, and one beginning in Ub, 
which we denote by Y. Since W is rich we have both X and Y are palindromes with X ^ Y. 

If both |X| and \Y\ are greater than \U\ + 1, then both \X\ and \Y\ must be even. In fact, 
suppose to the contrary that \X\ were odd. Then \X\ > \U\ +2. But then W would contain three 
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palindromes of length \U\ + 2, namely aUa, bl/b, and the central palindromic factor of length 
\U\ + 2 of X which is necessarily distinct from both all a and bUb since X cannot contain an 
occurrence of U other than as a prefix and as a suffix. The same argument shows that \ Y\ must be 
even. Without loss of generality we can assume \X\ < \Y\. Then, as X and the central palindrome 
of Y of length \X\ are distinct, it follows that W contains two distinct palindromes of even length 
\X\. Thus, iV(|X|) = 2, and hence Pw(N — \X\) = 0, and hence Pw(N) = 0, a contradiction. 

Thus it remains to consider the case in which either \X\ or \Y\ is equal to \U\ + 1. Without loss 
of generality suppose |X| = \U\ + 1. This means that X = Ua = aU and hence U is the constant 
word U = a'* 7 '. In this case \Y\ > \U\ + 2 and by the previous argument must be even. But then 
X and the central palindrome of Y of length \X\ are two distinct palindromic factors of W of even 
length, a contradiction. Thus we have shown that conditions (A') and (B') are equivalent. 

Now we show that (A') is equivalent to (C). The first author showed in [7] that every finite 
Sturmian word is trapezoidal. Thus (A') implies (C). To see that (C) implies (A'), we proceed by 
induction on \ W\. The result is clearly true if \W\ < 2. Next suppose the result is true for \ W\ < 
N and let W be a trapezoidal palindrome of length N. Since a trapezoidal word is necessarily 
on a two-letter alphabet, say {a, b}, we can write, without loss of generality, W = aVa. Then 
V is a trapezoidal palindrome, since factors of trapezoidal words are trapezoidal (see 0). By 
the induction hypothesis, V is a Sturmian palindrome. If W is not Sturmian, then there exists 
a palindrome U such that aUa and bUb are factors of W. Since V is Sturmian, we have aUa 
is both a prefix and suffix of W, and bUb is a factor of V. Since in V, all complete returns to 
U are palindromes, between an occurrence of bUb in V and the suffix aU of V there must be 
an occurrence of bUa. Since V is a palindrome we have aUb is also a factor of V. Hence each of 
aU a, bUb, aUb, and bUa is a factor of W. This implies that both aU and bU are right special factors 
of W, a contradiction since the trapezoidal property implies that for any < n < \W\ , there exists 
at most one right special factor of W of length n. Thus W must be Sturmian. This concludes our 
proof of Theorem [21 

Remark: A. De Luca |[T0l suggested the following alternate simple proof that (C) implies (A'): 
Let W be a trapezoidal palindrome. Without loss of generality we can assume that \W\ > 2, for 
otherwise the result is clear. Let U denote the longest proper palindromic suffix of W. Since W is 
a palindrome, U is the longest border of W, whence | W\ — ttw + \U\-By Proposition^ W is rich, 
hence U is the longest repeated suffix of W. Thus Kw = \U\ + 1. Since W is trapezoidal we have 
that 7r w = \W\-\U\ = R w + K w - \U\ = R w + 1. By Proposition 28 of we deduce that W 
is Sturmian. 
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